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Abstract. We examine the spectral profile bound of Goel, Montenegro and Tetali 
for the L°° mixing time of continuous-time random walk in reversible settings. We 
find that it is precise up to a log log factor, and that this log log factor cannot be 
improved. 



1. Introduction 

Of all the formulas suggested in the literature as bounds for the mixing time of 
a finite graph (see e.g. [LK99, MP05, FR07|), possibly the most promising, from a 
geometric point of view, is the spectral profile formula. Introduced by Goel, Mon- 
tenegro and Tetali |GMT06|, it brings into the realm of finite graphs the idea of 
Faber-Krahn inequalities. A Faber-Krahn inequality is an inequality relating the vol- 
ume of a set A and the first eigenvalue of the Laplacian with Dirichlet boundary 
conditions on A — we will give all definitions in the discrete settings below, but for 
the history of the topic, mainly in continuous settings, one should consult BG94I , 



BC01I §VIII.6] or [B01|, which has a somewhat different take on this topic and an 
excellent historical survey. This approach is promising because, as Grigory'an dis- 
covered [G94[, on a general complete manifold it gives sharp estimates on the 
decay of the heat kernel, even in cases where the manifold does not have polynomial 
volume growth. The requirement of polynomial growth was essential in previous 
approaches to this problem, using Sobolev |V85[ or Nash |CKS87| inequalities. 

Let us describe Faber-Krahn inequalities in the discrete settings. We will work 
with weighted, undirected, finite graphs. Let G be such a graph and w:GxG-t 
[0, oo ) the weight function. The heat kernel is defined by 

K(x,y) := w ^ x, y) lo(x) :=y^ lo(x.z). 

v y; co(x) w z 

The heat kernel is a stochastic matrix (i.e. J^yK(x,y) = 1) and hence describes a 
Markov chain on G. The symmetry co(x,y) = co(y,x) gives that it is self -adjoint 
with respect to the stationary measure n defined by 

tv(x) 



7t(x) :- 
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and therefore the associated Markov chain is reversible. It is important to note 
that the results of [GMT06J are not restricted to the reversible case, and apply to 
any finite Markov chain, but in this paper we will restrict our attention to the 
reversible case. The Laplacian, which is an operator on L 2 (G, cv) is defined simply 
as A := I — K and is self-adjoint and positive. 

When A C G is some subset, we will introduce the restricted Laplacian with 
Dirichlet boundary conditions 

^ ^f)( ) otherwise. ^ ^ 

The smallest eigenvalue for will be denoted by \q(A). It is easy to see that 
Aq (A) may also be defined as 

A (A)= inf M|> (2) 

supp/cA 11/ | 

m 

where (f,g) = YLx f( x )s( x ) n ( x ) an< ^ ll/llp = Tlx f( x Y n { x )- It is somewhat more 
elegant to describe the results of [GMT06| with the following quantity instead, 

supp/cA 1 1/| || - I |/| If 

/>o,/^const 

and we will adhere to this convention. Note that as long as n(A) < 1 — e the 
qua ti ties Aq and A are comparable [GMT06. eq. (1.4)]. A Faber-Krahn inequality is 
an inequality of the form A(A) < A(n(A)) for some function A, so the minimal 
function A satisfying this is defined by 

A(r) := inf AM). 

0<7T(A)<!- 

A(r) is the spectral profile. It is defined for all r > := mh\0^A G Q n(A). 

The main result of [ GMT06 [ is a bound for the L°° mixing time of the continuous- 
time random walk in terms of the spectral profile. Let us give the necessary defi- 
nitions. The continuous-time random walk on G is defined using —A as the infin- 
itesimal generator. Explicitly, we define 

oo ,n 

H t = e - t * = e- f Err^(^y) 

n=0 

and think about Ht(x,y) as the probability that a particle doing continuous-time 
random walk on G, starting from x will be at y at time t. Hence we define the 
mixing time using 



Too(e) := inf < t > : sup 

I x,y£G 



H t (x,y) - n{y) 



rc(y) 

We may now state the main result of [GMT06|, 



< e 



2dr 

Too e < / -7TT. 4 
Jin* rA(r) 

In the rest of the discussion we will fix e = J and denote the left hand side by Too 
and the right hand side by p. 
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1.1. The starting point of this short note was the hope that in fact (@J is precise 
in the sense that p < Ctco (Q). This was motivated by the fact that Faber-Krahn 
inequalities give sharp bounds in many interesting manifolds, and by the fact that 
(|4j is in fact sharp under a certain ^-regularity condition (see [GMT06. §3]). And 
in fact, the techniques there give quite easily (and with no regularity assumption), 
the following: 

Theorem 1. For any finite graph G, 

p < CToologlogl/7T*(G). 

Unfortunately, it turns out that this cannot be improved. Indeed we have 

Theorem 2. There exist a sequence n^ — > oo and graphs Gjt of size n^ and = such 
that 

p > CToologlogM^. 

The proof of theorem [2] is also not difficult — the graphs Gj- will be (details in 
section|2]| composed of log log n k pieces H, where each H, corresponds to a distinct 
range of r-s in the integral defining p (0). On the other hand, a random walker 
starting at H, will see only H, — when it finally leaves H, it will already be too 
mixed to notice any effects from the other Hj-s. Thus, perhaps the most natural 
question to ask is 

Question. Is it possible to have the graphs transitive? 

A graph G is transitive if for all x, y £ G there exists an automorphism of the 
graph taking x to y. It would be extremely exciting if the answer to the question 
were to be no. A less exciting, but nonetheless very natural question is as follows. 

Question. Is it possible to have the graphs Gj- unweighted and of uniformly bounded 
degrees? 

Here the rationale for the question is geometric. The analogy between graphs 
and manifolds works best for manifolds with bounded geometry and graphs with 
bounded degrees. Hence there is a certain discord in the fact that the examples 
constructed in theorem|2]are weighted. One would be tempted to solve the ques- 
tion by constructing the graphs (call them g^P 16 ) randomly, namely put an edge 

between x and y in g^P^ with probability lo (x, y) where to is the weight function 
of Gjt- However, more care is needed — applying the recipe above naively would 
immidiately create logarithmic tails that would dominate the mixing time. 

Finally, we remark that in non-reversible settings existing bounds are quite 
weak. For example, it is possible to have p > c\G\ 2 while Too < C\G \ log |G|. A 
careful discussion of this phenomenon can be found in |MT06, examples 5.3-5.5]. 

1.2. Another relevant set of problems revolves around the following: is the mix- 
ing time a geometric property? This is particularly interesting since many results 
in mixing have been achieved using representation theory (|BD92| is probably the 



Here and below we use C and c to denote absolute positive constants that may be different from 
place to place. C will be used for constants which are "large enough" and c for constants which are 
"small enough". The notation / f» g will stand for cf < g < Cf. 
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most famous) or using coupling (e.g. [LRS01]), techniques which are better de- 
scribed as "algebraic" rather than "geometric". To make the question formal let us 
define the notion of a rough isometry. 

Definition. Let X and Y be metric spaces and let / : X — > Y be a function and let 
K £ (0,oo). We say that / is a X-rough isometry if the following two properties 
hold: 

(1) For any a,b € X, 

^d{a,b)-K< d(f(a),f(b)) < Kd{a,b)+K. 

(2) For any y G Y there exists an x E X such that 

d(f(x),y)<K. 

To use this for Markov chains we will restrict ourselves to the simplest settings, 
that of random walk on a (unweighted) graph with bounded degree. In this case 
the graph has a natural metric structure given by the path metric, i.e. the distance 
d{v,w) is defined to be the length of the shortest path between v and 10. And we 
ask: is the mixing time invariant to rough isometries? Formally: 

Conjecture. Let G, H be two graphs with degG, degH < d. Let f : G — > H be a 
K-rough isometry in the path metrics on G and H. Then 

t(G) < C(K,d)r(H). (5) 

Since a rough isometry is reversible, this would in fact imply that t(G) ~ t(H). 

It is an interesting observation that all approximations for the mixing time I am 
aware of are rough isometry invariants. It is easy to see that isoperimetric inequal- 
ities are rough-isometry invariants, and hence both the Lovasz-Kannan integral 
[LK99J and the Fountoulakis-Reed integral [FR07J (which bounds the L 1 mixing 
time rather than our Too, but the conjecture is just as relevant for Tj) are rough- 
isometry invariants. To see that, for example, the spectral gap is a rough isometry 
invariants one has to define it using functional inequalities i.e. |3]l — note that the 
spectral gap is exactly A(G) — and then it becomes easy to check that the spec- 
tral gap and the spectral profile are both rough isometry invariants. Thus a precise 
bound of this style for the spectral gap would probably imply the conjecture. 

In particular, combining [GMT06. theorem 1.1] with theorem [1] gives a weaker 
form of {5): 

Too(G) < C(X,i)(loglog|G|)T co (H). (6) 
This result, however, is not new. Indeed, Too is comparable to the best constant in 
the logarithmic Sobolev inequality a defined by 

in the sense that 

c Clog log 1//T* 

^ Too S 



a a 

See |MT06| for historical background, the definition of the entropy Ent^ and for 
the equivalence (theorem 4.13 ibid). It is easy to see that a is a rough isometry 
invariance hence this gives another derivation of ((6). 

We end this discussion with an observation of Itai Benjamini, that the mixing 
time from a given point is not a rough isometry invariant. Thus, for example, the 
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mixing time from the root of a binary tree of height h is « h. However, the mixing 
time from a neighbor of the root is « 2 h (see [AFI chapter 5] for both). Since 
there is a rough isometry of a tree on itself carrying the root to a neighbor, this 
demonstrates the claim. 

I wish to thank Laszlo Lovasz and Prasad Tetali for many useful discussions. 
This material is partially based upon work supported by the National Science 
Foundation under agreement DMS-01 11298. Any opinions, findings and conclu- 
sions or recommendations expressed in this material are mine and do not neces- 
sarily reflect the views of the National Science Foundations 



2. Proofs 

Proof of theorem^ Denote by A k a Rayleigh set of measure 2~ k i.e. n{A k ) < 2~ k 
and 

X(A k ) = min{A(S) : tt(S) < 2~ ,c }. 
Where A is from ((3}. It is easy to see that 

Ll0g 2 l/7T»J ^ 

On the other hand, by IGMT061 lemma 3.1], for any k>2 

v > > 77TT (7) 

As for k — 1, we have l/A(Ai) < 1/A(G) but A(G) is just the spectral gap and 
hence 1/A(G) < Ctoo IMT06I theorem 4.9]. Hence holds for it = 1 as well. 
Therefore 

Ll0g 2 l/7T,J Ct 

P< T ~ Tl Oglogl/7T*. □ 

Proof of theorem^ We may assume w.l.o.g. that k is sufficiently large. We define 
simply 

n k =(k- [logic] +l)2 2k 
where log, here and below, is to base 2. The graph will consist of k — [logic] + 1 
pieces, which we denote by Hp log k i , H k , each with 2 lk vertices. We can already 
define the weight function between the H/s: for every v\, v 2 in different Hjs we set 

k2- k 

07(Ul,U 2 ) = -|^|-. 

Let Aj be a set of vertices of size 2 and B/ a set of size 2 . Setwise we define 
Hi — A\ x B; and then define the weight function co as follows: 



co({a 1 ,b 1 ),{a 2 , b 2 )) 



'm 2l ^ + n kl ~ k b ^ b > 



With this definition of CO we would have that to(v) = 1+2' k + ic2 * for every 
v E Hi. The inhomogeneity of to is somewhat bothersome so we modify co(y,v) 
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to fix this, writing 

co{v, v) = (1 - 2 l - k ) + -L + 7^2'-* + ^fcr* to G H, 
I ^/ 1 I "/| l^l 

with the result being that = 2 + fc2 _J: for all v. 

With our graph G defined we can start investigating its properties. We first 
estimate the spectral profile p. By the discrete inverse Cheeger inequality |AM85. 
lemma 2.1], for any set S, 

A(s) <c^ = c^. 

7r(S) Ctf(S) 

where we consider the weight function w as a measure which is a constant mul- 
tiple of 71. We use it for the set A\ x {pt} which we confusingly call A/. Now, 
co{Ai) ps | A ; | = 2 2k ~ 2 ' . There are two types of edges coming out of A ; , edges to 
H; and edges to the other H,s. The first type has weight 

j^2 l - k + j^k2- k = j^2 l -\l + o(l)) 

where the o notation above and also below means "as k — > oo, uniformly in / £ 

[logk,k]". so, after summing over all couples (vi,Vi), v\ 6 A\ and Vi G H; \ 

A; gives a total contribution < \A{\2 l ~ k {\ + o(l)). The second type has weight 
l 

M 

contribution < | A; |fc2 Since / > log k we get 



— kl k so after summing over all [v\, vi), V\ G A; and c>2 G G\H{ gives a total 



A (A ; ) < C^£> < C2'"*. (8) 

Now, 7r(A/) = |A ; |/|G| = l/(k- [logfc] +1)2 2 '. For brevity denote e = l/(fc- 
[logfc] + 1). We get that, 

dv 1 ft/ 22 '' 1 dv 1 , © 

> ft- / , — > ■ c2 l > c2 k . 



le/2* vA(v) - A(e/2 2 ') Je/2*' v ~ A(A ; ) 
Summing we get 

The proof will be finished once we show that r < C2 k . 

Let us therefore investigate the random walk on G. It will be convinient to 
represent it as follows. Assume the walker is at a vertex v G H;. We first throw 
a coin which has probability k/2 k to(v) of success. Call the event that this throw 
succeeded £j and in this case choose one of the vertices of G randomly with equal 
probability and move there. If £i did not occur, throw a second coin which has 
probability 



co(v) -k2~ 

to succeed. Call the event that this throw succeeded £2 ar *d in this case choose 
one of the vertices of H; randomly with equal probability and move there. Finally, 
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throw a coin with probability 

1 1 

0)[v) - k2- k - 2 l ~ k ~ 2 - 2 l ~ k 

to succeed (if I = k it always does). Call the event that this throw succeeded £3 
and in this case choose one of the vertices of the copy of A; containing v randomly 
with equal probability and move there. If none of £1, £2 and £3 succeeded, stay 
at v. It is easy to see that this is equivalent to the walk on the graph (in fact, we 
defined the weights on the graph with this representation in mind). 

Examine a random walk of length 2 k+1 , starting from some V E H/. Let zv £ G 
and examine P(R(2* C+1 ) = zv) (that starting point will always be V — we will not 
remind this fact in the notation). We first note that after an event of type £1 the 
walk is completly mixed. Define T\ to be the first time when £1 occurred, which is 
a stopping time. Using the strong Markov property we get, for every t < 2 k+1 , 

p({ti = *} n {R(2 k+1 ) = zv}) = p( Tl = f) 

and summing over t gives 

P({Ti < 2 k+1 } n {R(2 k+1 ) = zu}) = P( Tl < 2 k+1 ) < p. (9) 
Now, the event X\ > 2 k+1 can be estimated simply using 

which immediately gives a lower bound P(R(2 fc+1 ) = w) > (1 - o(l))/\G\ valid 
for all zv. Further, if zv £ Hi then 10 gives an upper bound, since one cannot reach 
from v to zv without a £1 event. Hence we will henceforth assume w G H; and 
Tx > 2 k+1 . The estimate JTOt is nice, but far from our goal of • 

After an event of type £2 the walk is totally mixed in H;. Therefore if we define 
T2 to be the first time when £2 occurred then a similar calculation to the above 
shows that 

P({Ti > 2 k+1 } n {t 2 < 2 k+1 } n {R{2 k+1 ) = »}) = 

1 CD) 1 

P(tj >2 fc + 1 )P(x 2 <2 /: + 1 |T 1 >2 ,c+1 )-^ < e" 2 ^ 3 ^ 

I-"/ 1 I-"/ 

With lO we have 

P({min{T 1 ,T 2 } <2 k+1 }n{R(2 k+1 ) = zv}) < ^(1+ (1)). (11) 
Again we note the probability that T2 > 2 k+1 : 




P(min{T 1; T 2 } > 2 k+1 ) < (l-2 , - k - i y k+1 < e" 2 ' 



(12) 
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For the last part we assume iv 6 A\ where here A\ is the copy of A\ containing v. 
We define T3 as the first time £3 occurred and get 

P({min{T 1/ T 2 } > 2 k+1 } n {t 3 < 2 k+1 } R {R{2 k+1 ) = w}) = 

P({min{T 1/ T 2 } > 2 Jr+1 })P(T 3 < 2 k+1 | min{T : ,T 2 } > 2 ^ +1 )^j ^ 

^■^-'tkY <13) 

Finally, in the case that X\, T 2 , T3 > 2 k+1 (so w must be v) we definitely have 

PCnrinH,^} > 2 k+1 ) <(l- j^p-^j 1 < 2~ 2k+1 = (J-^ . (14) 
Summing up ifTTT l, | fT3l l and l(l4l l we finally get 

F*(R(2 k+t )=w)< |1(1 + (1)) 

and hence for sufficiently large, t < 2 fc+1 . This ends the theorem. □ 
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